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Abstract 

We study the problem of maximizing a stochastic monotone submodular function with respect to a 
matroid constraint. We study the adaptivity gap - the ratio between the values of optimal adaptive and 
non-adaptive policies - and show that it is equal to . This result implies that the benefit of adaptivity 
is bounded. 

We also study the myopic policy and show that it is a i-approximation. Furthermore, when the 
matroid is uniform, approximation ratio of the myopic policy becomes 1 — j which is optimum. 

1 Introduction 

The problem of maximizing submodular functions has been extensively studied in operations research and 
computer science. For a set A, the set function / : 2-^ ^ M is submodular if for any two subsets S,T C A 
we have 

f{SUT) + fiSnT)<fiS) + f{T) 
An equivalent definition is that the inequality below holds for any S <^ T <Z A and j ^ A 

fiT + j)-fiT)<f{S + j)-fiS) 

where /(• + j) denotes /(• U {j})- Also, function / is monotone if for any two subsets S CT C A: 

fiS) < f{T) 

A wide range of optimization problem that arise in the real world can be modeled as maximizing a 
monotone submodular functions with respect to some constraints. One instance is the welfare maximization 
problem [HI HH US] which is to find an optimal allocation of resources to agents where the utilities of the 
agents are submodular. Submodularity corresponds to the law of diminishing return in economy. 

Another application of this problem is capital budgeting in which a risk-averse investor with a limited 
budget is interested in finding the optimal investment in different projects [241 . The utility function of a 
risk averse investor is submodular. It is also naturally non-negative and monotone. 

Another example is the problem of viral marketing and maximizing influence through the network I14lll8j. 
where the goal is to choose an initial "active" set of people, so as to maximize the spread of a technology or 
behavior in a social network. It is well-known that under many models of influence propagation in networks 
(e.g., cascade model [T^), the expected size of the final cascade is a submodular function of the set of initially 
activated individuals. Also, due to budget limitations, the number of people that we can activate in the 
beginning is bounded. Hence, the maximizing influence problem can be seen as a maximizing submodular 
function problem subject to cardinality constraints. 

Yet another example is the problem of optimal placement of sensors for environmental monitoring |16[ 117] 
where the objective is to place sensors in the environment in order to most effectively reduce uncertainty 
in observations. This problem can be modeled by entropy minimization and, due to the concavity of the 
entropy function, it is a special case of submodular optimization. 
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For the above problems and many others, the constraints can be model by a matroid. A finite matroid 
M is defined by a pair {A, I) , where Z is a collection of subsets of A (called the independent sets) with the 
following properties: 

1. Every subset of an independent set is independent. 

2. If S and T are two independent sets and T has more elements than S*, then there exists an element in 
T which is not in S and when added to S still gives an independent set. 

Two important special cases are uniform matroid and partition matroid. In a uniform matroid, all the 
subsets of A of size at most for a given fc, are independent. Uniform matroids represent cardinality 
constraints. A partition matroid is defined over a partition of set A, where every independent set includes 
at most one element from each set in the partition. 

The celebrated result of Nemhauser et al. |20j shows that for maximizing nonnegative monotone sub- 
modular functions over uniform matroids, the greedy algorithm gives a (1 — ^ ~ 0.632)-approximation of 
the optimal solution. Later, they showed that for optimizing over matroids, the approximation ratio of the 
greedy algorithm is i. Recently, Calinescu et al [5] proposed a better approximation algorithm with ratio 
1 — i. It also has been shown that this factor is optimal (in the value oracle model), if only a polynomial 
numljer of queries is allowed [ini . 

However, these algorithms are designed for deterministic environments. In practice, one must deal with 
the stochasticity caused by the uncertain nature of the problem, the incomplete information about the 
environment, etc. For instance, in welfare maximization, the quality of the resources may be unknown in 
advance, or in the capital budgeting problem some projects taken by an investor may fail due to unexpected 
events in the market. As another example, in viral marketing some people in the initial set might not adopt 
the behavior. Also, in the environmental monitoring example, it is expected that a non-negligible fraction 
of sensors might not work properly for various reasons. 

All these possibilities motivate the problem of stochastic submodular maximization. In the stochastic 
setting, the outcome of the elements in the selected set are not known in advance and they will be only 
discovered after they are chosen. 

1.1 Problem Definition 

In defining the problem, we need to use some care to maintain generality. Consider a set A = {^"1, • • • , Xn} 
of n independent random variables over a domain A. The domain varies depending on the application. For 
instance, in the welfare maximization problem, Xi denotes the quality of the resource or in viral marketing 
Xi corresponds to the set of people who arc influenced by person i. The distribution of each Xi is potentially 
different and is given by a function gi. 

Let Xi denote a realization of Xi. Also, let vector s =< ii, • ■ • ,i„ > denote a realization of set S (Z A^ 
where Xi = Xi for Xi ^ S and Xi = for i ^ S. For a given function / : A" — > ]R+, we can define the 
stochastic function F : A ^ IR.+ as F{S) = E[/(s)], where s is a realization of S and the expectation is taken 
with respect to the product distribution defined by g^'s. 

Also, consider a subset T G A, and a realization t of T. We can define a conditional expectation E[/(s)|t]. 
In the distribution imposed by conditioning on t, Si = ti if its corresponding random variable is in 5 fl T . 
Otherwise Si is chosen independently with respect to the distribution defined by giS. Let us denote this 
conditional expectation with F(S,t). 

We call the set function F stochastic monotone submodular if F{-, t) is monotone submodular for every t. 
Observe that if / is monotone submodular then F is stochastic monotone submodular because it is a convex 
combination of monotone submodular functions. 

Remark: We assume that either we can compute the value of function up to a desired degree of accuracy 
explicitly, or F is given to us via an "oracle" . This is a natural assumption for all the applications mentioned 
in the paper. In fact, in most cases the expectations can be computed simply by using sampling. For 
example, sampling works when the probability distribution functions are constant Lipschitz continuous, or 
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when their support is a polynomial size set of discrete values. In both cases, a small o(l) error is introduced 
in the calculations that we ignore in the rest of the paper. 

Definition: [Maximizing a stochastic monotone suhmodular function] The set A — {Xi,--- ,X„} of n 
independent random variables, the matroid A4 ~ {A,I), and the stochastic monotone submodular set 
function F : 2^ ^ R+ are given. Find a subset S El that maximizes F, i.e., maxggi E[F(S')] where the 
expectation is taken over the probability distribution of the sets chosen by the policy. 

A special case of the above problem is the stochastic max k-cover problem which is defined as follows. 
Suppose a collection A of random subsets of = {1, 2, • ■ • , m} are given. Each element E Ais a random 
subset of N, and it is distribution is denoted by a probability distribution g^. In the stochastic maximum 
fc-cover problem, the goal is to choose k elements of A such that their union has the maximum cardinality. 
We discuss this problem in more detail in Section [TT] 

For the problem of maximizing a stochastic monotone submodular function, we study two types of policies: 
adaptive and non-adaptive. A non-adaptive policy is represented by a fixed subset of A. An adaptive policy 
is a decision tree. It assumes that the value of each random variable can be observed as soon as it is chosen 
and it uses the observed values of the previously chosen elements to determine the next element in the subset. 

We compare these policies by studying the adaptivity gap of the problem. The adaptivity gap is defined 
as the ratio between the expected values of optimal adaptive and non-adaptive policies. Adaptivity gap has 
been previously studied for stochastic maximization problems with respect to covering p3, and packing [7l[8] 
constraints. 

1.2 Results 

We present approximately optimal policies for the stochastic monotone submodular maximization problem. 
First, in Section^ we compare the performance of the optimal adaptive and non-adaptive policies. Although 
non-adaptive policies may not perform as well as adaptive ones, they are particularly useful when it is difficult 
or time consuming to discover the outcome of an element. For example, in the capital budgeting problem, 
it is not possible for the investor to wait until the end of each project to measure the success, or in the 
environmental monitoring problem, it is not practical to measure the performance of sensors after placing 
each sensor in the environment. 

Surprisingly, we learn that the adaptivity gap of the problem is equal to « 1.59. In other words, 
there exists a non-adaptive policy which achieves at least fraction of the value of best adaptive policy. 
This result leads to a (^^)^ ~ 40% approximation of the optimal adaptive policy by a non- adaptive policy 
that runs in polynomial time in n. We also give an example to show that our analysis of the adaptivity gap 
is tight. For that, we use a simple instance of the stochastic max fc-cover problem. 

In Section [21 we focus on natural myopic policies. We study the natural extension of the myopic policy 
studied in [5j in a stochastic environment. This policy iteratively chooses an element with the maximum 
expected marginal value, conditioned on the outcome of the previous elements. 

We show that the approximation ratio of this policy with respect to the optimal adaptive policy is ^ for 
general matroids. We also prove that over uniform matroid (i.e., subject to a cardinality constraint), the 
approximation ratio of this policy is 1 — ^.0 Due to the results of [inilinii the approximation ratio of 1 — i 
is optimal only if a polynomial number of oracle accesses is allowed. 

The closest work to ours in the literature is by Chan and Farias [3]. They mainly study the problem of 
stochastic submodular optimization over partition matroids. In their model, there is an ordering over the 
partitions and any adaptive policy has to choose one element from each partition according to the given 
order. They present a ^-approximation of the optimal adaptive policy (that respects the ordering) using a 
myopic policy. In our setting, we do not have a fixed ordering. In addition, we prove most of our results for 
general matroids. 

The results for the uniform matroid has appeared in a preUminary version of this work [3] . 
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2 The Adaptivity Gap of Stochastic Submodular Optimization 
Problem 



In this section, we analyze the optimal adaptive and non-adaptive policies and compare the performance 
of the two. First, observe that since non-adaptive policies do not observe the realized value of the items 
until the end, they may choose all the elements in one step. In other words, any non-adaptive policy can be 
represented by the set of chosen elements. 

On the other hand, an optimal adaptive policy selects the elements based on the realized values of the 
previously chosen elements. Note that the policy knows the probability distribution of the values of the 
elements that are not yet chosen, but not their actual values. 

Although an adaptive policy can clearly perform better than a non-adaptive policy, we show that its 
advantage is limited. The main result of this section is as follows: 

Theorem 1 The adaptivity gap of the stochastic monotone submodular maximization problem is equal to 



In order to prove the above theorem, we start by establishing an upper bound on the adaptivity gap. In 
Section [2.11 we give an example that shows our analysis of the adaptivity gap is tight. 

Before proving the theorem, observe that since F{S) is a submodular function, we can use the following 
result of Calinescu et. al. [5]: 

Theorem 2 (Calinescu et al [5]) Given oracle access to F (see Remark 1), there exists a polynomial time 
algorithm that achieves an approximation ratio o/ 1 — i — o(l). 

The above theorem immediately implies that 

Corollary 3 A {l — ^—o{l))- approximation of the optimal non-adaptive policy can be computed in polynomial 
time. 

Theorem [T] and the above corollary imply that: 

Corollary 4 There is a policy that is non-adaptive and also runs in polynomial time and computes a solution 
that is within (^^)^ of the optimal adaptive policy. 

In the rest of this section, we prove Theorem [T] The proof is inspired by the techniques developed in 
Section 3.5 of [21j for submodular optimization [in a non- stochastic setting). For the sake of consistency, we 
use the same notation as [H] wherever possible. 

We start by making a few observations about adaptive policies. First, any adaptive policy can be 
described by a (possibly randomized) decision tree in which at each step an element is being added to the 
current selection. Consider an arbitrary adaptive policy Adapt. Each path from the root to a leaf of this 
tree corresponds to a realization s G / of the sequence of elements chosen by Adapt. Here, / denotes the set 
of all possible realizations of sets in X. Let y =< yi, ■ ■ ■ ,yn > represent the probability that each element 
of A is chosen by Adapt, i.e., yi is the probability of choosing Xi. These probabilities sum up to 1. Also, 
let (3s denote the probability density function for outcome s G /. Then, we have the following properties: 
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The first two properties hold because /3 defines a probabihty measure on the space of all feasible outcomes. 
The third property implies that the probability that we observe outcome Xi (a realized value of Xi) among all 
possible outcome s is equal to the probability that Xi is chosen (i.e., iji) multiplied by the probability that the 
outcome is equal to Xj. This property holds because of the independence among the random variables. Since 
every policy satisfies the above properties, we can establish an upper bound on the value of any adaptive 
policy. Hence, we define the function /+ : [0, 1]" ^ M as follows: 



f~^{y)^ sup I / asf{s): / = 1, > 0, Vi, : / asds = yigi{xi)dxi> 

a KJsei Js Js,Si£dxi ) 



(1) 



Another observation is that for an optimal adaptive policy, vector y described above is in the base 
polytope of 7W (defined as follows). A set 5 G X is called a basis for the matroid if \S\ = max{|r| : T G /}. 
The base polytope, B{A4), is defined as: 

B{M) = conv{ls|5 € J, S* is a basis} 

Here "conv" denotes the convex hull and Is is the characteristic vector of S, i.e., 1 for elements in S and 
for other elements. 

Lemma 5 The expected value of the optimal adaptive policy is at most 'aiaXyi=B(^M){f~^iy)}, 

Proof : Note that an optimal adaptive policy only chooses independent sets. Due to monotonicity, all of 
these are independent sets are bases of the matroid. Hence, for an optimal adaptive policy vector y defined 
above is in B{M). Moreover, the expected value of the adaptive policy is bounded by f^{y), because the 
policy has to satisfy the 3 properties mentioned earlier. □ 

Now, we define an extension of set function F{S) to the domain of real numbers. For vector y G [0, 1]", 
let Y denoted a random set where Y includes Xi G A with probability yi. With abuse of notation, we define 
the extension F : [0, 1]" IR+ as follows: 

F{y) = E[F{Y)]^ ^ (lly.l[{l-yAF{Y). 

Y is a basis of I \ieY i(Y ) 

Function f^{y) sets an upper bound on the adaptive policies. We now establish a lower bound on the 
value of optimal non-adaptive policies via the following lemma from |21j (Lemma 3.4), which is based on 
pipage rounding [T]. 

Lemma 6 121}/ Any vector y G B(M) can be rounded to an integral solution S of value F(S) > F(y). 

To complete the proof we need to show that for any vector y, the values of F{y) and f~^{y) are within a 
constant factor of each other, which is established by combining Lemmas 3.7 and 3.8 from [21] . 



Lemma 7 f21f For any monotone suhmodular function f and any vector y we have 

f^{y) < {^)F{y) 



Proof : [Theorem [T] Lemma [5] shows that maxyg5(^) f^iv) is an upper-bound on the performance of 
the optimal adaptive policy. Consider y* G argmaXj,g^^^^/+(2/). By Lemma [71 we have F{y*) is at least a 
(1 — i) fraction of the expected value of an optimal adaptive policy. On the other hand. Lemma [H] implies 
that there exists a S" G X such that F{S) > F{y*). Note that F{S) is in fact the expected value gained by a 
non-adaptive policy that selects set S. Hence, S is a (1 — ■i)-approximation of the optimal adaptive policy. 
By Proposition [TUl in the next section, this factor is tight. □ 
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2.1 A Tight Example: Stochastic Maximum fc-Cover 

Given a collection A of the subsets of = {1, 2, • • • , n}, the goal of the max k-cover problem is to find 
k subsets from A such that their union has the maximum cardinality |10j . In the stochastic version, the 
subset that an element of A would cover is revealed only after choosing the element, according to a given 
probability distribution. 

The following reduction shows that this problem is a special case of the stochastic monotone submodular 
maximization. For S G A, let F{S) denote the expected number of elements covered by the subsets in S. 
Clearly, F is monotone. Consider two subsets 5 C T C ^, an element X G A, and a realization y of an 
arbitrary subset of A. Because UAes^ ^ ^BerB, for every realization y, we have ^(5* + X) — F{S) > 
F(T + X) - F{T). In addition, X = (^, {S* C ^ : l^l < k]) forms a uniform matroid. Therefore, the 
stochastic max fc-cover problem is in fact a stochastic monotone submodular maximization problem. 

In this section, we define an instance of stochastic max fc-cover problem that gives a lower bound on the 
adaptivity gap. This example has been brought to our attention by Vondrak |22| . 

Consider the following instance: a ground set N — {1,2,- ■• , n} and a collection A ~ {X^^\\ < i < 
n, 1 < J < n?} of its subsets are given. For every i, j, define xj*-* to be the one-element subset {i} with 
probability i and the empty set with probability 1 — ^. The goal is to cover the maximum number of the 
elements of N by selecting at most k = n? subsets from A. 

Lemma 8 The optimal non-adaptive policy is to pick n subsets from each of the collections A^^'^ — {X^^^\l < 
j < n^} for every i. For large enough values of n, the expected value of this policy is (arbitrarily close to) 

Proof : Consider an arbitrary non-adaptive policy which picks S, containing sets from A. For each i, 
define ki — |S'n^'-*''|. Moreover, each element i £ N is covered if and only if at least one of its corresponding 
chosen subsets are realized as a non-empty subset. Hence, it will be covered with probability 1 — (1 — ^)*''. 
Therefore, the expected value of this policy is -'^ ~ (1 ~ Note that 1 — (1 — is a concave 

function with respect to x, and also ki = . Hence, the expected value of the policy is maximized when 
ki = k2 = ■ ■ ■ = kn = n. In this case, the expected value is (1 — (1 — ^)")"- ~ (1 — for large n. □ 

We now consider the following myopic adaptive policy V: Start with i — 1 and pick the elements of 
one by one until one of them is realized as {i} or all of elements in A^^^ are chosen. Then increase i by one. 
Continue the iteration untill i ^ n + 1. 

The following lemma gives a lower bound on the number of elements in N covered by the adaptive policy. 

Lemma 9 The expected number of elements in N covered by V described above is (1 — o{l))n. 

Proof : Let Xk be the indicator random variable corresponding to the event that the subset chosen at the 
fc-th step is realized as a non-empty subset for any 1 < fc < n^. Note that the number of elements covered 

2 

by ^ is X]fc=i ^fc- Moreover, all Xk's are independent random variables. 

By the description of V, as long as X^iLi ^fe < '^^i will be one with probability ^ and will be 
zero with probability 1 — Also, when ~ "-j have already covered all the elements in N. 

Therefore, Xt+i, • • • , X„2 will all be equal to zero. With this observation, we define i.i.d random variables 
Yi, 12, • ■ • , Yn^, where each Yi is set to be one with probability i and zero with probability i. Observe that 
min{n, F — X^fe^fe} ^^'^ same probability distribution as '^i,Xk- Note that E[Y] = n. Using Chernoff 
bound, we have 

Pr[r < n - < g""^ = e""'^'. 
Thus, with probability at least 1 — e~" we have Y > n — v?^^ . Hence, 

Xk\ = E[min{n, Y}\ > (1 - e~"''')(n - n^l^) = n- o(n), 

k=l 
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The myopic adaptive policy: 

Initialize i = 0, 5o = 0, ?7o = 

While (A^UtUSt) 

i ^ < + 1 
5*4 ^ St-i 
Repeat 

Select Xi e a.rgma,Xx^fzA\{Ut-iuSt^i)H^i^t-i + Xi)\st-i] 
If St-i U{Xi}^I then 

Ut ^ Ut-i U {X,} 

else 

^ 5t_i U {X,} 
Ut ^ Ut-1 

Observe Xi and update St 
Until (A = Ut-i U or {St + St-x) 



which completes the proof of the lemma. □ 

By combining the results of Lemmas |8] and [9] we have the following proposition: 
Proposition 10 For large enough n, the adaptivity gap of stochastic maximum coverage is at least ^rry- 

3 Approximation Ratio of Simple Myopic Policies 

In this section, we present an adaptive myopic policy with an approximation ratio of ^ with respect to an 
optimal adaptive policy. In Section 13. li we show that the myopic policy achieves the approximation ratio of 
1 — i if the matroid is uniform. Note that even if the actual values were known, the problem of computing 
the optimal policy is intractable. As mentioned before, the maximum /c-cover is a special case of our problem 
and Feige |10 , has shown that it is not possible to find an approximation ratio better than 1 — ^ for the 
maximum fc-cover problem, unless NP C TIME{n'^^^°>i^°sny-j^ 

The policy is given in the above figure. At each iteration, from the elements in A that are not yet 
considered, the policy chooses an element with the maximum expected marginal value. We denote by St 
the set of elements chosen by the adaptive policy up to iteration t. Let st denote the realization of all 
these elements. Also, Ut is the set of elements considered but not chosen by the policy due to the matroid 
constraint. Here is the main result of this section. 

Theorem 11 For general matroids, the approximation ratio of the myopic adaptive policy with respect to 
any optimal adaptive policy is ^• 

Define At = F{St) — F{St-i)- Also, let k be the number of elements chosen by the myopic policy (which 
is simply the rank of the matroid A4). The basic idea of the proof is similar to Fisher et al. [12]. But, 
the main difficulty is that the realized values of At are not always decreasing (due to the stochastic nature 
of the problem). In addition, the sequence of elements chosen by the optimal adaptive policy is random. 
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However, E[At|st_i] > E[At-(.i |st_i] (Note that E[At|st] > E[At+i|st] does not necessary hold). Based on 
this observation, we prove the theorem. We will also use the following lemma from [12]. 

Lemma 12 /i^/ Fort =!,■■■ ,k, we have X^Li < t. 

Note that T, U, and S are random sets, but the lemma holds for every realization because it is a 
consequence of the matroid constraint, not the realizations of the element chosen by the policy. 
We are now ready to prove the theorem. 

Proof : [Theorem lll| Let P be the (random) set of elements chosen by the optimal adaptive policy. 
Also, for t = 1, ■ ■ ■ , k, define Ct = P n (C/t+i \ Ut). Consider a realization sj of St- Because F is stochastic 
monotone submodular we have 

E[F(P)|s,] < E[ ^ F{S + l)\st] 
ieP\s 

The expectations, and in the rest of the proof, are taken over the probability distribution of all realizations 
of P such that the realized values of elements in P n S'f are according to st ■ Since the above inequality holds 
for all St, we have 

E[P(P)] < EiY, F{S + l)] 

ieP\S 

E[P(P)] - E[F{S)] < E[ ^ (FiS + 1)- P(5))] 

ieP\s 

Note that Ut=i Ct^P\S. Hence, 

k 

E[P(P)]-E[P(5)] < J2^[J2iFiS + l)~F{S))] 

t=i leCt 

By expanding the expectation we have 

E[P(P)] - E[P(5)] < E/ E[J2F{S + l)-FiS)\st^^]PT[st-i]dst-i (2) 

t^lJst-i-.St-i&I 

Observe that conditioned on st_i, because the myopic policy chooses an element with the maximum 
marginal value, we have At > F{S + 1) — F{S), I G C(. Therefore, 

E[^ F{S + F{S)\st-i] < E[^ At\st-i] 
leCt idCt 

By plugging the above inequality into ((2|), we get 

k 

E[P(P)] - E[P(5)] < E/ E[^ At|st_i]Pr[st_i]dst_i 

Using telescopic sums and the linearity of expectation we derive the following. Here A/j+i is defined 0. 

E[P(P)] - E[F{S)\ < Y.I E(^- - Pr[st-i]rfst_i 

t=i-' st^i-.St-iei i(=Ct j=t 

k j „ 

= EE/ E[^(A, -A,+i)|st_i]Pr[st_i]dst_i 
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Note that by using the Bayes' theorem and the law of total probability, for every t and j the integral 
term in the above is in fact equal to E[^jg(^^(Aj — Aj+i)]. Now, we can change the probability measure to 
calculate this expectation from st-i to Sj-i- Hence, we have 



k j 



E[F(P)] - E[F(5)] < f E[^(A,-A,+i)|,s,_i]Pr[s,_i]ds,_i 

k j ^ 

= EE/ (i?[|a|E[A,-A,+i|s,_i]|s,_i])Pr[s,_i]ds,_i 

j=l t=l •> Sj-i'-Sj-i&I 



Note that conditioned on Sj_i, the term E[Aj — Aj+i|sj_i] is by definition a constant and we can take it 
out from the outer expectation. Hence, 

E[i^(P)] - E[i^(5)] < E / ( eE \Ct\\sj-i\mj - A,+i|s,_i] ) Pr[s,_i]ds,_i 

We now use Lemma W2\ which implies that in every realization X]t=i 1^*1 — J- We also use the fact that due 
to the submodularity and the rule of the policy, we have E[(Aj — Aj+i)|sj_i] > 0. We conclude that 

nF{P)\-nF{S)\ < E/ jE[(A,-A,+i)|s,_i]Pr[s,_i]ds,_i 



3 

k 



E / E[A,|,s,_i]Pr[,s,_i]ds,_i 



= Ee[A,] 

Therefore, E[F{P)] < 2E[F{S)], as desired. □ 

Fisher et al. [T2j have shown that even in the non-stochastic setting, in the worst-case, the approximation 
ratio of the greedy algorithm (hence the myopic policy) is equal to 5. Also, it is easy to see that that if A4 
is an intersection of k matroids, then the approximation ratio of the myopic policy is equal to 

3.1 Uniform Matroids 

In this section we show that the myopic policy described in the previous section has a better approximation 
ratio if the matroid is uniform. 

Theorem 13 Consider the adaptive myopic policy that at each step selects an element with the maximum 
marginal value, conditioned on the realized value of the previously chosen elements. Over uniform matroids, 
the approximation ratio of this policy compared to the optimal adaptive policy is 1 — ^. 

The proof presented here is similar to the proof of Kleinberg et al.[15j for submodular set functions. 
The main technical difficulty in our case is that the optimal adaptive policy here is a random set whose 
distribution depends on the realized values of the elements of A. 

Proof : Let P denote the (random) set chosen by an optimal adaptive policy. Also, denote the marginal 
value of the t-th element chosen by the myopic policy by Aj, i.e.. 

At = F{St) - F{St-i) 
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Consider a realization st of St- Because F is stochastic monotone submodular, we have 

nF{P)\st] < E[FiP U St)\st] < E[F{St) + J^iFiSt + I) - F{St))\st]. (3) 

The above expectations are taken over all realization of P such that the realized values of elements in 
St ri P are according to St- Because the myopic policy chooses the element with maximum marginal value, 
for every j, 1 < j < A; (fc is the rank of A^), we have 

nAt+i\st] > nF{St +1)- F{St)\st] 

Therefore, we get 

E[FiP)\st] <E[F{St) + kAt+i\st] 

Since the above inequality holds for every possible path in the history, by adding up all such inequalities 
for alH, < i < fc — 1, we have: 

E[F(P)] < E[F{St)] + kE[At+i] 

= E[Ai + • • • + At] + kE[At+i] 

We multiply the t-th inequality, < t < k — 1, by {1 — ^)'^^^ *, and add them all up. The sum of the 
coefRcients of E[F(P)] is equal to 

fc-i 1 k-i 1 - n - i^fe 1 

Ed - 1^'-' = Ed - ly = \ a M ^ - - k^'^ 

t=Q t=0 ^ k> 

On the right hand side, the sum of the coefficients corresponding to the term E[A4], 1 < t < fc, is equal 

to 

j=t j=0 

= fc(l-^)'^-* + fc(l-(l-if-*) 

= k (5) 

Thus, by inequalities dU and (O we conclude 



(1 - (1 - jr)E[FiP)] < EE[At] = E[FiS, 



t=i 



Hence, the approximation ratio of the myopic policy is at least 1 — -. □ 



Acknowledgement We would like to thank Jan Vondrak for fruitful discussions. 
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